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Abstract 

The Frechet distance is a well studied and commonly used measure to capture the similarity of polyg- 
onal curves. Unfortunately, it exhibits a high sensitivity to the presence of outliers. Since the presence of 
outliers is a frequently occurring phenomenon in practice, a robust variant of Frechet distance is required 
which absorbs outliers. We study such a variant here. In this modified variant, our objective is to minimize 
the length of subcurves of two polygonal curves that need to be ignored (MinEx problem), or alternately, 
maximize the length of subcurves that are preserved (Maxin problem), to achieve a given Frechet distance. 
An exact solution to one problem would imply an exact solution to the other problem. However, we show 
that these problems are not solvable by radicals over Q. This motivates the search for approximate so- 
lutions. An approximate solution for one problem though will not provide any guaranteed approximate 
solution for the other problem. Thus, we present an algorithm, which approximates, for a given input 
parameter S, an optimal solution for the MinEx problem up to an additive approximation error 6 times 

the length of the input curves. The resulting running time is upper bounded by O log (f )^ , where 
n is the maximum complexity of the input polygonal curves. Furthermore, we present an approximation 
algorithm for the MaxIn problem that provides a (1 — 5)-approximate solution in O (^x^'^s(t)) time. 
This algorithm improves upon the time bound established by Har-Peled and Wang for this problem. 
Keywords: Frechet distance, similarity of polygonal curves, approximation, weighted shortest path. 

1 Introduction 

Measuring similarity between two polygonal curves in Euclidean space is a well studied problem in com- 
putational geometry, both in practical and theoretical setting. It is of practical relevance in areas such 
as pattern analysis, shape matching and clustering. It is of theoretical interest as well since the problems 
in this domain are fairly challenging and lead to innovative tools and techniques. The Frechet distance - 
one of the widely used measures for similarity between curves - is intuitive and takes into account global 
features of the curves instead of local ones, such as their vertices [H [3l [7] • Despite being a high quality 
similarity measure for polygonal curves, it is very sensitive to the presence of outliers. Consequently, 
researches have been carried out to formalize the notion of similarity among a set of polygonal curves 
that tolerate outliers. They are based on intersection of curves in local neighborhood llj, topological 
features [4| , or adding flexibility to incorporate the existence of outliers [7^ . In [7 , Driemel and Har-Peled 
discuss a new notion of robust Frechet distance, where they allow k shortcuts between vertices of one of 
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the two curves, where fc is a constant specified as an input parameter. They provide a constant factor 
approximation algorithm for finding the minimum Frechet distance among all possible /c-shortcuts. One 
drawback of their approach is that a shortcut is selected without considering the length of the ignored 
part. Consequently, such shortcuts may remove a significant portion of a curve. As a result, substantial 
information about the similarity of the original curves could be ignored. A second drawback of their 
approach is that the shortcuts are only allowed to one of the curves. Since noise could be present in both 
curves, shortcuts may be required on both to achieve a good result. For example. Figure [l^ shows two 
polygonal curves that both need simultaneously shortcuts to become similar. 

In this paper we discuss an alternative Frechet distance measure to tolerate outliers; it incorporates the 
length of the curves and allows the possibility of shortcuts on one or both curves. We consider two natural 
dual perspectives of this problem. They are outlined as follows using the common dog-leash metaphor for 
Frechet distance. 

Min-Exclusion (MinEx) Problem: If a person wants to walk on one curve and his/her dog on the other 
one, for a given leash length s > 0, we wish to determine a walk that minimizes the total length of all 
parts of the curves that need a leash length bigger than e. 

Max-Inclusion (Maxin) Problem: We are looking for a walk that maximizes the total length of all 
parts of the curves that a leash length less than or equal to e is sufficient. 

Observe that the solution for one problem leads to a solution for the other problem. An exact solution 
for these problems is presented in [5 , where the distances are measured using (more restrictive and 
much simpler) Li and Loo metrics. In Section [2] using Galois theory, we show that these problems are 
not solvable by radicals over Q, when distances are measured using L2-metric. (It is natural to study 
Frechet distance problems in L2-inetric, see e.g. |T].) This suggests that we should look for approximation 
algorithms. But now the two problems are distinct and an approximate solution for the MinEx problem 
does not provide any guarantee towards an approximate solution for the MaxIn problem, and vice- versa. 
An approximation algorithm for the MaxIn problem is outlined in [9j. To the best of our knowledge, nobody 
has previously studied approximation algorithms for the MinEx problem. In this paper, we provide an 
algorithm that provides a bounded approximate value for the MinEx problem. Also, as a byproduct, we 
improve the running time of the algorithm in 9J for the MaxIn problem. 
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a) Two polygonal curves Ti and T2. 



b) Free-space diagram for Ti and T2 



Figure 1: a) A possible solution is illustrated by the connecting lines between the parametrization for Ti and T2. The 
subcurves on both polygonal curves that should be ignored are illustrated by the blue, red and green subcurves on Ti and 
Tb. So, Q^(ri, T2) is the summation of the lengths of the colored subcurves and {Ti,T2) is that of the black subcurves. 
b) The solution corresponds to an a;i/-monotone path in the deformed free-space diagram F. In this space, {Ti,T2) can 
be measured by summing the lengths of its subpaths going through the forbidden space (shaded gray area), measured in 
the Li-metric (similarly for Q^{Ti,T2)). 



1.1 Preliminaries 

Let Ti : [0, 1] — > and T2 : [0, 1] — ^ be two polygonal curves. Their Frechet distance is defined 
as the minimum leash length required to walk only forwardly, in parallel on both Ti and T2, from the 
starting points to the ending points, at which the two walks could have different variating speeds. More 
formally, two monotone parametrizations ai, a2 : [0, 1] — >■ [0, 1] define, for each time i G [0, 1], a matching 
(Ti (ofi (t)) ,T2 (a2 (t))) of one point on Ti to exactly one point on T2 and vice-versa. The needed leash 
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length for the two parametrizations is defined as the maximum Euclidean distance of two matched points, 
over all times. Then, Frechct distance Sp {Ti,T2) is defined as the infimum of the required leash lengths 
over all possible pairs of monotone parametrizations [I] : 

6f (Ti, T2) inf max {jTi (ai (t)) (^2 (t)) |} , (1) 

ai,a2:[0a]->[04] te[0,l] 

where |.| denotes the Euclidean distance. For simplification, we say from now, that all considered 
parametrizations are monotone. The corresponding Frechet distance decision problem asks if there exist 
two parametrizations for a given leash length e, realizing a Frechet distance between Ti and T2 that is 
upper bounded by e. In other words, it asks if it is possible to walk your dog with a given leash of length 
£, such that you and your dog stay on your own curves. For a fixed leash length e, a pair of arbitrary 
points on the curves of Ti and T2 is called forbidden, if their Euclidean distance is bigger than e and 
otherwise, it is called free. We refer to a pair of parametrizations (ai,a2) for Ti and T2 w.r.t. e, as a 
possible solution for Ti and T2. Analogously to the matching of points on the curves of Ti and T2, we 
define a pair of parameters pi and p2 as forbidden (free), if the Euclidean distance between Ti (pi) and 
T2 {P2) is greater (equal to or less) than e. In this context, the Frechet distance decision problem asks, if 
there exists a parametrization, such that all corresponding matchings are free [I] . 

To decide, whether the Frechet distance between two polygonal curves is upper bounded by a given e, 
the free-space diagram is computed. The free-space diagram is a decomposition of the parameter space 
[0, 1] X [0, 1] of Ti and T2 into two sets. The first one is the forbidden-space, which is defined as the 
union of all forbidden parameter pairs. The second set is the free-space, defined as the complement of 
the forbidden-space. Let ni (respectively, 712) be the number of segments of Ti (respectively, T2) and let 
n = max(ni,n2). Since we consider the worst case running time, we assume, w.l.o.g., n = ni = n2- The 
free-space diagram is a rectangle, partitioned into n columns and n rows. It consists of v? parameter cells 
C*^^ , for i,j = 1, n, whose interiors do not intersect with each other. For each parameter cell C*^-', there 
exists an ellipse such that the intersection of the area bounded by this ellipse with C*'^ is equal to the 
free-space region of that cell [I . Free-space of C"'-' is denoted by and its forbidden space by C^g ■ We 
consider the free-space diagram in the Cartesian plane and assume that it lies axes aligned. We say that 
a point lies to the left (right) of another point, if its x-coordinate is smaller (greater) than the second one. 
Analogously, we say, that a point lies below (above) another point, if its y-coordinate is smaller (greater) 
than the second one. We define a point s dominates another point s', if and only if s' does not lie to the 
right or above s. 

For simplification, we say that paths, curves, edges, etc. are a;j/-monotone if they are nondecreasing 
in both X- and y-coordinates. We know that any pair of parametrizations (01,012) corresponds to an 
a;y- monotone path in free-space diagram, TTgt, connecting the bottom left corner of the diagram, s, to the 
upper right corner, i. So, deciding if the Frechet distance of Ti and T2 is upper bounded by e, is equivalent 
to the decision: Does there exists an a^y-monotone path in the free-space diagram for e, connecting s to t 
and avoiding the forbidden space [T]? 



1.2 Problem Definition 

Let Ti and T2 be two polygonal curves, each consisting of at most n line segments; and e > be a constant. 
For the MinEx (respectively, Maxin) problem, the quality of a solution is the sum of the lengths of the 
subcurves of Ti and T2 that lie in the forbidden (respectively, free) space (see Figure [I] for an example). 
Formally, for a given pair of parametrizations (oi, 02), let Sq^q.^ C [0, 1] be the closure of the set of times 
such that the corresponding parameter pairs are forbidden, and Wa^a2 ^ [0, 1] be the closure of the set of 
times such that the corresponding parameter pairs are free. We define the quality of a solution (01,02) 
for the MinEx problem Qf^^^ and for the MaxIn problem QS^q,^ as follows (see Figure [l|i): 

Qf,«, / Ti (ai (t)) dt+ f T2 (02 (t)) dt 

QZc. / Ti(oi(t))dt+ / T2{a2{t))dt 
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We call (ai,a2) optimal if it minimizes (respectively, maximizes) Qa^a2 (respectively, Q^q,^) and define 
the quality ofTi and T2 w.r.t. e as its value, i.e., 

Qi,a2:[0,l]-i-[0,l] 

Q'^iTuT^) sup QZa. ^"^^ 

Qi,a2:[0,l]-f[0a] 

This means that the quality of Ti and T2 is the minimum (respectively, maximum) sum of lengths of 
curves on Ti and T2 to be ignored (respectively, matched), to obtain a Frechet distance not greater than e. 
For a given e, we would like to find a solution whose quality is not much worse than an optimal solution. 
To do so, we transform this problem setting into a weighted xy-monotone path problem, between s and 
t, in the free-space diagram of Ti and T2. To measure the sum of the lengths of the subcurves of Ti and 
T2 directly in the free-space diagram, we stretch and compress the columns and rows of the diagram, 
such that their widths and heights are equal to the lengths of the corresponding segments. We call the 
resulting diagram the deformed free- space diagram and denote it by F. To solve the MinEx (respectively, 
Maxin) problem, we look for an xy-monotone path n^i C F from 5 to t, where our goal is to minimize 
(respectively, maximize) the length of TTji, lying in the forbidden (respectively, free) space of F. Note that 
the length of the polygonal curves Ti and T2 corresponding to the parts of TT^t lying in the forbidden or 
the free space equals the length of ttsi measured under Li-metric in the corresponding space. We have 
the following observation. 

Observation 1. Let Ti and T2 be two arbitrary polygonal curves in and let F be the corresponding 
deformed free-space diagram for a leash length of e. Let tt^^ <Z F be a path corresponding to a pair of 
parametrizations (01,02) o/Ti andT2 w.r.t. e. Then, the sum of the lengths of the forbidden (respectively, 
free) paths of tTsi, measured under Li-metric, is equal to Qaia2 (i^espectively, QJ^q^J. 

Now it is easy to see that the MinEx and MaxIn problems are transformed to the following path 
problems. 

Weighted shortest xy-monotone path (wShortMP) problem: Compute an xy-monotone weighted 
shortest path from s to t in F, where the weight in the forbidden-space is one and the weight in the 
free-space is zero. The length of a path is defined as the sum of the lengths (measured in Li-mctric) of 
the part of the path lying in the forbidden space. 

Weighted longest xy-monotone path (wLongMP) problem: Compute an xy-monotone weighted 
longest path from s to t in F, where the weight in the forbidden-space is zero and the weight in the 
free-space is one. The length of a path is defined as the sum of the lengths (measured in Li-metric) of 
the part of the path lying in the free space. 



1.3 New results 

In Section [2] we establish that the MinEx and MaxIn problems are not solvable by radicals over Q. 
This is proved using Observation [T] and showing that the wShortMP problem is unsolvable within the 
Algebraic Computation Model over the Rational Numbers (ACMQ). In this model we can compute exactly 
any number that can be obtained from the rationals Q by applying a finite number of operations from 

— , x,-^, y, for any integer /c > 2 The proof is based on Galois theory. Motivated by that, we 

turn our attention towards approximation algorithms for the MinEx and the MaxIn problems. 

In Section |3) we transform the MinEx problem to the wShortMP problem, which in turn is transformed 
to a shortest path problem in directed acyclic graphs (Lemma [s]). We propose an algorithm that ap- 
proximates the weighted xy-monotone shortest path up to an additive error. This error is related to the 
lengths of the curves Ti and T2. The running time of this algorithm is O (^ji^ (Theorem jij) . In Section 

|4| we improve this running time to O log (^)^ (Theorem jsj). To do so, we solve a subproblem related 

to forming a 'small' graph over a convex set of points that preserves Li-distances between certain pairs 
of points. Furthermore, the solution to this problem also improves the running time of the algorithm in 



[9] for the MaxIn problem, from O to O (^^ log (f )) (Tb 

discuss why FPTAS for the MinEx problem may not be feasible, thougE 
is fairly straightforward to achieve. 



4). In Section 



we conclude and 



he FPTAS ofthe MaxIn problem 
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2 Unsolvability of MinEx and Maxin Problems 



Observation [I] implies that solving the MinEx (respectively, MaxIn) problem is equivalent to finding a 
weighted shortest (respectively, longest) xy-monotone path, connecting s and t in F, where the forbidden 
(respectively, free) space is weighted with 1 and the rest with 0, i.e., solving the wShortMP (respectively, 
wLongMP) problem. The difficulty is that these weighted path problems need to be solved through 
obstacles which have curved (elliptical) boundaries. In Theorem [l] we prove that the wShortMP problem 
is not solvable within the ACMQ. The strategy to prove Theorem [T] is as follows. From Galois theory 
(refer to [5]), we know that there is no general formula to solve polynomial equations of degree d> b using 
radicals. However, there are some polynomial equations of degree d > 5 that can be solved by radicals. 
It is the Galois group Gal(p) of an irreducible polynomial p over Q that determines the solvability of the 
equation p{x) = by radicals. Indeed, p{x) = is solvable by radicals over Q if and only if Gal(p) is 
solvable (refer to [S| for an introduction to solvable groups) . In the proof of Theorem [Tj we provide an 
example of two trajectories for which both the length and the coordinates of the bending points of any 
weighted shortest xy-monotone path cannot be computed by radicals. We reduce such a computation to 
the solution of a polynomial equation of degree 8. We show that the Galois group of this polynomial is 
the symmetric group over 8 elements, which is not solvable. 

Theorem 1. LetTi andT2 be two polygonal curves. Denote by s (respectively, by i) the bottom left corner 
(respectively, the upper right corner) of their deformed free-space diagram F. Let n^i be any weighted 
shortest xy-monotone path (with respect to Li-metric) from 5 to i. In this setting, the wShortMP problem 
is unsolvable within the ACMQ, i.e., in general, both the length and the coordinates of the bending points 
of TTgt cannot be computed by radicals. 

Proof Take Ti = abc, T2 = de and e = 1, where a ^ (0,0), b = (1,0), c = (-1,-^), d = (-i, |), 
e = (|, ^) (see Figure [2^). Hence, in the deformed free-space diagram, s — (0, 0) and t = (|f^, ^) (refer 
to Figure ^) . The parametric equations of ab, be and de are respectively 

Cell ab - de Cell be - de , 
f • • I 




Figure 2: a) Two polygonal curves, Ti and T2. b) The deformed free-space diagram F for Ti and T2. The dashed line is 
a weighted shortest xy-monotone path from 5 to i. The dotted line is a weighted shortest a;j/-monotone path that crosses 
£1 but not £2- 



ab: a + —Ab - a) ^ m(1,0) (0 < it < 1), 
\ab\ 
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In cell ab-de, we have the ellipse 



24 /7 3^^ 



where < a; < ^ and < y < ||. In cell bc-de, we have the ellipse 

^ /480, 24 3\^ 

£2: { (x-l)-\ y + 

^481 ^ 25^ 2 J 



^ ^ 31 , ^, 7 , 

(.T - 1) H V + - ] = 1, 

481 ^ 25 ^ 4 / 



where |||| < x < ^ and < y < |||| . Since s £ £1 , then either 

1. TTst crosses £1 but not £2 If Tr^t crosses £1 but not £2, we find the following optimal path by elemen- 
tary calculus. Travel in £1 from s to [jg (35a/2 — 43) , ^ (l4\/2 — 15)) and then to 1 (outside of £1 
and outside of £2). The length of this path is ^ (2851 - 1200%/2) « 4.80810. 



2. TTjt crosses both £1 and £2- If '"'st crosses both £1 and £2, then Tr^t must exit £1 at the same height 
it enters £2 since is inclined towards £2- Let h be this height and let h' be the height at which 
TTst exits £2- By elementary calculus, we find that for an a;j/- monotone path to be shortest, we need 
to have h = h' . 

The length of such a shortest path can be expressed in the following way: 

1591 49 V4375 - 4200/i - 784h^ \/165296875 - 212680800/i - 27373824/1^ 
^40" ^25 Too 12025 ' 

for an h to be determined. If we look for the values of h for which the derivative of this last expression 
is 0, we find that h must be a solution of 



p{h) = 585090042379589947534557557525634765625 - 3039825965000401080955586792871093750000 /i 
+5307213095548843266935155031210937500000 - 2973595218630130711131340711267500000000 
-649444075888789852190828979088700000000 + 562445109533777824218782819614464000000 
+193996238215889538903991144689745920000 + 21705929355568145355212682312548352000 h'^ 
+826789346560923302640987287586865152/1*^ = 0. 

By numerical methods, we find h « 0.50696 and the length of the corresponding path is approxi- 
mately 4.59277, so it is a global weighted shortest a;y-monotone path. 

Notice that p is irreducible over Q, since p is irreducible modulo 101 (and 101 does not divide the 
leading term of p), see 8J. Moreover, by using a simplified version of Bajaj's lemma (refer to O [2]), 
we can prove that the Galois group of p is the symmetric group over 8 elements, which is not solvable. 
Hence, p{h) = is not solvable by radicals. Consequently, both the length of Tr^i and the coordinates of 
its bending points cannot be computed by radicals. □ 

Combining this with Theorem [T] and Observation [T] we have the following result. 

Corollary 1. // is not possible to design an algorithm that can exactly solve the MinEx (or Maxin ) problem 
within the ACMQ. 



3 An Approximation Algorithm for the MinEx Problem 

In this section, we present an approximation algorithm with an additive error for the MinEx problem. The 
input to the problem consists of two polygonal curves Ti and T2, an arbitrary fixed leash length e > 
and an approximation parameter S > 0. We want to compute a pair of parametrizations (ai,Q!2) and its 
quality Qf^^^ ^'^^ ^1 ^^'^ ^2, such that Qf^^^ is a good approximation of {Ti,T2). We also want to 
construct two polygonal curves, T{ and such that 6f {T[,T2) < e, that correspond to a solution for 
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the MinEx problem. We abbreviate the deformed free-space diagram by F, its free-space by and the 
forbidden-space by B. Recall that F consists of 0{n^) cells and each cell is a rectangle, whose free space 
is (a portion of) an ellipse. Our approach is as follows. 

We have seen in Section [TT2] how to transform the MinEx problem into the wShortMP problem in the 
deformed free space diagram F. To design an approximation algorithm for the wShortMP problem, we will 
define a graph G over F, and show that for each path Tr^i in F, there exists a path Tr^t in G, which stays 
close to TTst. Thus, paths in G approximates ccy-monotone weighted shortest paths in F. Once we have a 
shortest path in G, it will be fairly straightforward to embed this path in F, and deduce an approximate 
solution for the MinEx problem. Though we are casting a geometric problem into a combinatorial setting, 
to simplify our notation, we may refer to a point p Cz F as also a vertex p in G - the meaning will be clear 
from the context. We construct G as follows. 

Step 1: Construct F. First compute the free-space diagram using the algorithm of Alt and Godau [T], 
then stretch the columns/rows of the diagram, such that widths/heights are equal to the length of the 
corresponding segments to obtain the deformed free-space diagram F. 

Step 2: Construct Grid. Add j additional equidistant vertical and horizontal grid lines to F (Figure 
|3|. Find the intersection of every vertical grid line £y. with the boundary of each ellipse. For each of these 
intersection points, add a new horizontal intersection line, passing through that point. Perform analogous 
steps for every horizontal grid line £hi- For each intersection point add a vertical intersection line, passing 
through that point. 

Step 3: Construct G. Compute the arrangement A induced by all of the grid lines, the intersection 
lines and the boundary of ellipses. The vertices of G are the vertices in A. For each edge (p, q) in A, if 
either or is xy-monotone then add the corresponding directed edge into G. The weight of this edge 
is equal to its length in Li-metric if it is lying in the forbidden space, otherwise it is zero. 



t 




Figure 3: An arrangement of lines and the boundary of an ellipse. Blue solid lines are grid lines and dotted dark red 
lines are intersection lines. Crosses represent vertices on the boundary of the ellipse resulting from intersections of the 
ellipse with the lines. Circles show some of the vertices on the intersection of (grid and intersection) lines. All vertices on 
a line are connected by directed edges (colored orange arrows) preserving a;j/-monotone ordering. Also, two vertices on the 
boundary of the ellipse are joined by an edge (also colored orange), if they are consecutive and the edge is xy-monotone. 

Observe that G is acyclic as all of its edges are directed and are xy-monotone. Compute a weighted 
shortest path from s to i in G, and output its corresponding geometric embedding as the desired approx- 
imate solution. In the next three lemmas, we establish that G can be used to provide an approximation 
algorithm for the wShortMP problem. First we need the following definition. A vertex s' is directly dom- 
inated by the point s G _F, if and only if, s' is dominated by s and there exists no vertex in G in the 
interior of ss'. 

Lemma 1. Let -Est C W be an xy-monotone shortest path from s to t, connecting s on a grid line Is with 
t on another grid line £t inside a parameter cell. Furthermore, let s' (z £s ( respectively, t' d £t) be a vertex 
from G directly dominated by s (respectively, t). Then, there exists a path its't' in G from s' to t' , such 
that TTs't' C W. 
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Proof. Since s E W and s' is directly dominated by s, we know s' G W {s could be equal to s'). We 
discuss the following four cases: 

a) s and t lie on two vertical lines and i^^-^ (Figure Bk)). 

Let a be the intersection point of i^' with We walk on the horizontal line £g> 9 s' , until we 

leave f at a point x G d£ or we reach a. If we encounter a first, we are done. Assume we first 
reach x. This implies that the gradient of d£ in x is positive and so its tangent line has to be 
a;2/-monotone. Since TTst C W, it follows that d£ is squeezed between T^, tTsj, i'^ and i'^^.i- We denote 
this region by R. The boundary d£ cannot turn back, as this would contradict the convexity of £. 
This implies that we can start at x, follow d£ using only xy-monotone edges, stay in R and reach 
£■^1 at a point y € R. Since all the utilized edges lie in W, the final path is in W. 

b) s and t lie on two horizontal lines £'l and £!l^i (Figure Wb)). 
Use the same argument as in a), but first walk upwards. 

c) s e and t e £^ (Figure [i];)). 

Use the same argument as in a), but first move to s and then walk upwards. 

d) sei'l and t e £'"j (Figure [ijl)). 

Use the same argument as in a), but first move to s and then walk to the right. 

After reaching the point y E d£ D £t, walk on £t from y towards t, until the closest vertex t' lying below 
and to the left of t is reached. Since t,y G W, that guarantees again t' G W. □ 




a) s e £^ and t e £:['+i. h) s e i'^ a.nd t e £'1+^. c) s € £^ and < G d) s e and i G 

Figure 4: Four cases of configurations for s and t, and their corresponding grid lines that are used in the proof of Lemma 


For the next lemma we need the following notation. Each parameter cell C*'-' of F defines a rectangle, 
having the left side C]^-' , the right side C]^"' , the bottom side Cg-' and the top side Cj.-' . Furthermore, let 
||7r|| be the weighted length of a path tt in F, measured using the Li-metric. Let T be a polygonal curve 

fc 

composed of k segment, T — {toti,tit2, ■ ■ ■ ,tk-itk), then the length of T, |r| — ^ |ii_iti|. 

1=1 

Lemma 2. Let C^'-' he an arbitrary parameter cell of F. Let -Est be a shortest xy-monotone path in C^'\ 
connecting a point s € C|^'"' U C]^"' with an arbitrary point t € C^"* U C^^ . Let £s ( respectively, £t ) he 
the side of C^'^ on which s (respectively, t) lies (see Figure^. Let s' G £s (respectively, t' G £t) be a 
vertex directly dominated by s (respectively, t). Then, there exists an xy-monotone path t^s'Vj such that 
W^s-t'W < ||7r,t||+8f -maxllTiUTsI}. 

Proof. We will construct a path Tr^'f C G such that it connects s' and t', and it passes through W 
whenever iTst passes through W. This guarantees that the weighted length of TTj/t/, in Li-metric, is not 
"much" bigger than that the weighted length of ngt- Let £ be the ellipse describing M^in C^'^ . Since £ is 
convex and Tr^t is a shortest path, it implies that there exists at most one entrance point a into £ and at 
most one exit point b from £. 

Let Va be the first point on tTsj after a (w.r.t. Tr^t) that is on a horizontal or vertical grid line £a (see 
Figure [5]). Let v'^ be the vertex directly dominated by Va- Analogously, we define Vh as the last intersecting 
point on Tr^t with a horizontal or vertical grid line £b, lying before b. We denote by G £h the vertex 
that is directly dominated by Vb- Let tti, ...,7rfc be the sub curves between Va and Vb on tTsj, separated by 
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grid lines. Their concatenation is the curve T^v^vt connecting Va and Vf, on tt^^. We know that TTat C W, 
so TTi C Wfor aU i — I, fc. Let t^v'^v'^ C WCi G be the concatenation of the paths obtained by applying 
Lemma [l] on each tt^. The two remaining sub-curves of nst (one before and one after TTvaVt) ^re passing 
through obstacles. The parts in the cells enclosing a and b are exceptions. Let Wa be the entry point of 
TT^t to the grid subcell that contains a. Let w'^ be its left bottom corner vertex. Since s and s' lie in the 
same grid cell, it follows: 

|ss'U<^.max{|riUr2|}, (4) 

where \ss'\x denotes the length of the projection of ss' on the x-axis. Furthermore, is dominated 
by Wa and the weighted shortest connecting xy-monotone path tTs^^ between s and Wa lies completely 
inside B. Thus, it follows that | ^ I . Combining this with (HI) , we obtain an xy- monotone 

weighted shortest connecting path tTs'w'^ from s' to w'^, such that UtTs/^uJI < UTr^iUall +2^ 'maxHTil, IT2I}. 
We do not know how large the part of T^WaVa ^ti^^ passes through B oy W is. Hence, we assume that 
T^WaVa C W. However T^WaVa is enclosed only by one grid cell (the one that contains w'^ and v^)- Thus, 
it follows that there exists an xy-monotone path tTu,'!,' (corresponding to T^WaVa ^-i^d connecting with 
v'a), for which HtTid^uJI < 2^ • max{|Ti|, 1X2!}. In the same way, we construct the paths ^v'^w'^^ and n^'^t'^ 
such that the concatenation iis't' of all these four paths is not much longer compared to ||7rst||, i.e., 
Il^f.'t'll < lk.t|l+8^max{|ri|,|T2|}. □ 

Lemma 3. The graph G has a complexity of O (fr)- ^/ ""st C F (respectively, n^i C G) is a weighted 
shortest xy -monotone path in F (respectively, G) then ||vr5t|| < ||7r5t|| < ||7r5t|| + (5 • max{|Ti|, IT2I}. 

Proof. Denote the sequence of grid cells that Tr^t intersects from s to t by ci,...,Cfc. Let Tr^t be the 
concatenation of the paths obtained by applying Lemma [2] with 5 := ^ on each c^. The error made in 
each cell is therefore upper bounded by ^ • max{|Ti|, \T2\}. Since Tr^t passes through at most 2n cells, it 
follows that \\TTsi\\ < ||7fst|| < Ikstll + S ■ maxjlTil, IT2I}. Each of the grid lines can intersect at most 
2n ellipses, thus requiring the addition of at most ^^^^ additional intersection lines. So, the arrangement 
of all these lines has a complexity of O ^^^^ □ 

The above lemma shows that for the wShortMP problem, Tr^i C G approximates ttsi C F. Next, 
we show how to derive an approximate solution for the MinEx problem, given Tr^f The path Tr^t passes 
through a sequence of parameter cells. For each edge in n^i, find its embedding in F. Since each of these 
embedded edges are xy-monotone and the end vertex of one edge is the start vertex of the next one, this 
results in an xy-monotone path from s to t in F. Let Tr^t be the weighted shortest xy-monotone path 
connecting s to t. From Lemma [s] it follows that ||7rst|| < ||7fj,t|| < ||7rsi|| + 6 ■ max{|Ti|, IT2I}. Since Tr^t 
is a concatenation of segments, it can be directly transformed into two corresponding parametrizations 
fii and a2- Since ||7rjjt|| < Ht^siH + ^ ' max{|ri|, 1121}, the approximation quality of the solution (cii,Q!2) 
follows from Observation [l] Let TTab be a maximal subpath of Tr^t, passing through the forbidden-space and 
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connecting the points a and b, which he both on the boundary of the free-space. These points correspond 
to two matchings (ai, 02) and (bi, 62) of two points lying on Ti and T2. Since both he on the boundary of 
the free-space, it follows that \aia2\ — e and |6i52| — £• This implies that the Frechet distance between 
aibi and 02^2 is not greater than e. We exchange the curve between ai and bi on Ti by the segment aibi 
and we proceed analogously for the curve between 02 and &2 on ^2 using the segment 0262- With these 
substitutions for all maximal subcurves of ttsj, passing through the forbidden-space, we obtain two new 
polygonal curves T{ and T2, with Sp 12) < e. Observation [l] implies, that the sum of the lengths of the 
removed subcurves on Ti and T2 is equal to Qfj^^.^- So, the sum of the lengths of the substituted curves 
does not exceed Q^{Ti,T2) + S ■ max{|Ti|, IT2I}. The running time follows directly from Lemma [sj by 
running the linear time algorithm for finding a shortest path in a directed acyclic graph. We summarize 
our result for the MinEx problem in the following theorem. 

Theorem 2. Given two polygonal curves Ti and T2 in the plane, an arbitrary fixed e > and an 

4 

approximation parameter S > 0, we can compute in O(^) time a pair of parametrizations ((5i,Q!2) and 
Its quality Q?^^^ for Ti and T2, such that Q^[Ti,T2) < Qf^^, < Q^(Ti,T2) + 6 ■ max{|Ti|, IT2I}. 
Furthermore, we can construct two polygonal curves, T[ and T2, realizing Q^^g.^, such that Sp {T[, Tj) < e, 
if the distances between starting and ending points of Ti and T2 are not greater than e. 

4 Improvements for the MinEx and Maxin problems 

First, we present an abstract problem and then use it to improve the solutions to the MinEx and MaxIn 
problems. Suppose is a convex region and P is a set of n points on the boundary of fi, denoted d^. 
Two points pi and p2 are said to be xy-monotone if piP2 is a;y-monotone, in which case we write pi \ P2- 
Our goal is to construct a directed graph G'{P), where the vertices of G'{P) are Steiner points together 
with the points in P satisfying the following. For each pair of points Pi,Pj G P such that Pi f Pj: there 
exists a directed path in G' between the vertex corresponding to pi to the vertex corresponding to pj and 
the cost of this path is where is the length of pipj in Li-metric. We can accomplish this 

task by examining (2) pairs of points from P. However, we show that G'{P) can be constructed with the 
aid of Steiner points in 0{n\ogn) time and size. Our method is based on the following simple geometric 
observations [10]. 

Observation 2. Let a (respectively, b) be a point in the South- West (respectively, North-East) quadrant 
of the Cartesian coordinate system. Then there exists an Li-shortest path from a to b that passes through 
the origin. 

Observation 3. Let a and b be two points such that a 'I b. Any xy-monotone path from a to b lies inside 
the bounding box of a and b. Furthermore, all xy-monotone paths from a to b, have the same length in 
Li-metric. 

We compute G'{P) as follows. Initialize the vertex set of G'{P) to be P. Then add the following 
Steiner points. Compute the vertical median line m, splitting P into at least — Ij points to the left 
and at least [-^^ — Ij points to the right of m, respectively (Figure [g^). Let mi be the upper and m2 be 
the lower intersection points of to H dfl. Denote by £1 and £2 the horizontal lines containing toi and TO2, 
respectively. Add toi and TO2 as Steiner points to the set of vertices of G'{P). Partition P into three sets 
as follows. Let Pabove be the set of points lying above £1, Pbeiow be the set of points lying below £2 and 
let Pmiddie ^ P \ (Pbeiow U Pabove)- For all the poiuts Pi e Pmiddie, computc their orthogonal projection 

onto TO. Add p™ as a Steiner point. Moreover, add the edge to G"(P), directed with respect 

to xy-monotone order. This projection onto m implies an ordering among the points in Pmiddie, with 
respect to increasing y-coordinate. Let {pi,...,pk) = Pmiddie be this ordering. For i = l,...,fc— 1, add 
the edges {p™,Pi]^i) to G'{P), directed with respect to a;y-nionotone order. Also add the edges (m2,p™) 
and (p™,TOi). For all vertices pi e Pabove (respectively, pj e Pbeiow), add the edge {mi,pi) (respectively, 
(pj,m2)). The weight of each edge is its length in Li-metric. Let Pieft (respectively, Pright) be the set of 
elements of P lying to the left (respectively, right) of to. Recursively apply the construction for Pieft and 

Pright- 
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(a) (b) 

Figure 6: (a) The point set P is partitioned witli respect to its median line m. Each pi £ Pmiddie is projected onto m (blue 
arrows and red disks). The projections are ordered with respect to their y-coordinates (orange arrows). Each pj G Pabove 
(respectively, Pj G Pbeiow) is connected to mi (respectively, 7712) (dark green arrows), (b) Each v £ Vge is connected by 
directed ST^-monotone edges (light green edges) to all four sides of dC^'^ . 



Lemma 4. Let Q be a convex region and P be a set of n points on its boundary. For each pair of points 
Pi,Pj S P such that Pi t Pj> there exists a path i^p^.p- in G' {P) between the vertex corresponding to Pi to 
the vertex corresponding to pj, and its cost is \piPj\i. Furthermore, the complexity of G' {P) is 0{n\ogn). 

Proof. The points pi and pj are separated by a median line in one of the recursive calls. Since pi f Pj, 
the projection of pi is below the projection of pj on that median line. Thus, they are connected by an 
a;y- monotone path TTp.^p. in G' . The cost of this path follows from Observation pi The recursion depth is 
at most O (log n) as the problem is partitioned with respect to the median. In each call, a linear number of 
edges and Steiner points are added with respect to the size of the input. Thus G' has O (nlogn) vertices 
and directed edges. □ 

Improvement for the MinEx problem: In Section [3j we showed how to find an approximate solution 

to wShortMP in O (js2^ time. To do so, we defined a neighborhood graph G = {V, E) of size O (72)1 

computed a shortest path tt in G and proved that tt approximates the solution to wShortMP. In this section, 

we show how to compute an approximate solution from a neighborhood graph G* of size O log ( j)^ 

The graph G* is defined as follows. For each parameter cell C*'^ with ellipse £, we restrict V to the 
boundaries of C''^ and £. Formally, let Vqc^.j = dC''^ n V and Vqs ^ dSnV. The vertex set of G* for 
this cell is V^p^.j U Vge U V , where V is the set of vertices of G'{Vd£) as defined above. 

Lemma 5. The size of Vgc-^ UVqsUV' is 0{1 log (f )) . 

Proof. Based on the construction of G, in Section [3j there are ^ equidistant vertical and horizontal 
grid lines. In the worst case all of them are intersecting dC^'^ and d£. Also for each intersection point 
on d£ there exists an intersection line. So, in the worst case, the total number of vertices of G on 
dC''^ and d£ (i.e., Vqc^., U Vqs) is = O (f). Moreover, since V is the set of vertices of G'iVas), 
based on Lemma Wl the complexity of G'iVds) is O[jlog[j)). So, the size of Vgci.j U Vge U V is 
0(t)+0(flog(^))=0(tlog(f)). □ 

For each parameter cell C*'^ with ellipse £, we define the set of edges of G* as follows. For each 
V & Vgs, there is an edge between v and each of its projections on . Each of these edges are directed 
with respect to the xy-monotone order and weighted by the length of their intersection with the forbidden 
space (Figure [6]d) . Moreover, each edge in G'{Vd£) is an edge in this cell with weight zero. Finally, G* is 
the union of all the graphs defined for each cell. 



Lemma 6. The size of G* is O log (f ^ 
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We now explain how to compute an approximate solution to wShortMP by using G* as a neighborhood 
graph. 

Lemma 7. LetiTs't' d G be a weighted shortest path connecting two points on the boundary ofG^'^. Then, 
there exists a path 7?*,^, in G* , such that ||7f*/j/|| < ||7rs't'||- 

Proof. Let £ be the ellipse corresponding to C"'-'. Since iTs't' corresponds to an xy-monotone path, there 
exists at most one vertex a (respectively, b) where iTst enters (respectively, exits) £. So, iTs't' consists 
of three sub-paths ng'a C B, iVab C W and Tr^t/ C B, whose concatenation is TTs't'- Here, a is defined 
as the last (respectively, first) vertex of TTs'a (respectively, iTab) and b as the last (respectively, first) 
vertex of TTab (respectively, TTbf). Since tTs'o is xy-monotone, it follows that a dominates s' . Moreover, a 
dominates the orthogonal projection of a onto the grid line on which s' lies. This implies that there exists 
a path TT*,^ connecting s' and a. Since Tr^'a C B and both paths have the same ending points, it follows 
||7fs'oll ^ ll^s'all- Since nab is xy-monotone, we have a t ^- It follows from Lemma |4] that there exists 
a path TT*f^ C W. This implies that ||7f*^|| = (because the weight in free-space is zero). Analogously, 
to the construction of 7?*,^^ it follows, that there exists a path in G* , such that 117?^*' 1 1 — ll^'^bt'll- The 
concatenation of the three constructed paths results in the required path. □ 

The following theorem is an improvement over Theorem [2] The proof is the same except that for each 
parameter cell, we apply Lemma [7] instead of Lemma [3j 

Theorem 3. Given two polygonal curves Ti and T2 in the plane, an arbitrary fixed e > and an 
approximation parameter 5 > 0, we can compute in O log (j)^ time a pair of parametrizations (ai, 0:2) 

and Its quality Q?^~^ for Ti and T2, such that Q^(Ti,T2) < Qi^s.^ ^ Q^(Ti,T2) 5 ■ max{|Ti|, IT2I}. 
Furthermore, we can construct two polygonal curves, T[ and Tj, realizing Q^^a^, such that Sp {T[, Tj) < e, 
if the distances between starting and ending points of Ti and T2 are not greater than e. 

Improvement for the Maxin problem: Har-Pcled and Yusu [9 proposed an (1 — (5)-approximation 
algorithm for the MaxIn problem whose running time is O {jr^ : where n is the size of the input polygonal 
curves. For each parameter cell C*'^ with free-space £, they place -g^^ Steiner points uniformly on d£. 
After projecting the Steiner points onto the four sides of C*'-', they use dynamic programming to find an 
optimum path between them inside this cell. Their algorithm performs O operations per cell. We 

improve their solution by proving that it is sufficient to perform O (f log {j)) operations for each cell. 

Similarly to our improved solution to the MinEx problem, we construct a neighborhood graph G* . 
Here though, for each parameter cell C*'-' with free-space £, the set of vertices and edges of G* is defined 
as follows. Let Vos be the set of equally spaced Steiner points on d£ as in the algorithm of [9]. Each 
V e Vds is connected to its four orthogonal projections onto the sides of C*^-' . Also, every two consecutive 
projection points on the boundary of C"'-' are connected by an edge. For vertices in Vqs, we construct the 
graph G'{Vq£). Each edge in G* is directed with respect to a;y-monotone order and is assigned a weight 
equal to the length of its portion that lies inside £. 

Theorem 4. Given two polygonal curves Ti and T2 in the plane, an arbitrary fixed e > and an 
approximation parameter 5 > 0, we can compute in O log (f )^ time a pari of parametrizations {cti, 0,2) 

and its quality Q^_^^^ forTi andT2, such that [1 — 5) {Ti,T2) < Q^^qj < Q^(Ti,T2). Furthermore, we 
can construct two polygonal curves, T[ and T2, realizing Q^^,, such that dp {T{, Tj) < e, if the distances 
between starting and ending points of Ti and T2 are not greater than e. 

Proof. From the above construction, Lemma |4] and Observation [3] it follows that for each pair of vertices v 
and w on d£, such that v\ w, there exists a directed path in G' {Vqs) joining v to vu, lying in £. Moreover, 
these vertices are connected to the boundary of their cells using the projected edges, and the cells are 
glued together at the projection points. Hence, there is an xy-monotone path tt* in G* from s to t. The 
fact that the cost of tt* is at least (1 — 5) of an optimum path follows from [9]. 
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Since \V3£\ = O (f), the size of G'iVos) inside each C»J is O (f log(f)). Hence, the size of G* is 
O (^^-j- log {j)^- Each edge of G* is directed to maintain xy- monotone order. Thus, it is a directed acychc 
graph and we can find a longest path in time linear in the size of the graph. □ 



5 Conclusion 



Our approach to measure the similarity between two polygonal curves in the presence of outliers is to 
minimize the portions of the curves where the matching is not possible (MinEx problem) or maximize the 
matched subcurves (Maxin problem). After proving that these problems are not solvable in the ACMQ, 
we designed approximation algorithms for the MinEx and MaxIn problems. What makes the solutions to 
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a) An arbitrarily small {Ti,T2). b) A path decision could be locally good, but maybe globally bad. 

Figure 7: a) A free-space diagram in which the weighted parts (red subcurves) of a weighted shortest a;j/-monotone path 
TTst are arbitrary small, compared to the length of Ti and T2. b) A weighted shortest subpath nab and the two possibilities 
TT^i, and Tv^ti to stay on G close to -Kab- Both solutions have a weight (sum of the lengths of the fat blue subpaths) much 
bigger than that of an optimal (sum of the lengths of the fat red subpaths). An additional intersection line, determined 
by an intersection point x of the boundary of an ellipse and a grid line, enables a path Tr^t, whose weighted subpaths are 
much smaller (sum of the lengths of the fat orange subpaths). 



the MinEx problem difficult to approximate is that, for any cell C"'^ with ellipse its forbidden space can 
be disconnected and the optimum path length may be negligible. But in the MaxIn problem, we benefit 
from the convexity oi £, as \d£\ = 0(1^1), where ui denotes the portion of the curves that are within 
Frechet distance of e in an optimal solution. This property gives us an opportunity to discretize d£ based 
on its length and S such that the summation of the errors along an a;y-monotone path will not be more 
than S\uj\. One of the open questions is whether it is possible to design a FPTAS for the MinEx problem. 
The canonical approach to achieve a multiplicative approximation error of (1 ± (S) is to place additional 
Steiner points, with "sufficient" density, based on S. Then, one could compute an xy-monotone weighted 
path in a neighborhood graph connecting these Steiner points. Such a task for the MinEx problem seems 
hard to achieve within reasonable running time. The main reason for this is that the sum of the parts of a 
shortest path n^t, being responsible for the weighted length of iTsi, could be arbitrary small compared to 
the length of the considered polygonal curves (Figure [t] shows an example). This implies that the number 
of required Steiner points could be arbitrary large and thus it seems hard to guarantee a running time 
which is polynomial in the input parameters. 
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